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Abstract — This work considers communication networks wliere 
individual links can be described as MIMO channels. Unlike 
orthogonal modulation methods (such as the singular-value de- 
composition), we allow interference between sub-channels, which 
can be removed by the receivers via successive cancellation. 
The degrees of freedom earned by this relaxation are used for 
obtaining a basis which is simultaneously good for more than one 
link. Specifically, we derive necessary and sufficient conditions for 
shaping the ratio vector of sub-channel gains of two broadcast- 
channel receivers. We then apply this to two scenarios: First, 
in digital multicasting we present a practical capacity-achieving 
scheme which only uses scalar codes and linear processing. Then, 
we consider the joint source-channel problem of transmitting a 
Gaussian source over a two-user MIMO channel, where we show 
the existence of non-trivial cases, where the optimal distortion 
pair (which for high signal-to-noise ratios equals the optimal 
point-to-point distortions of the individual users) may be achieved 
by employing a hybrid digital-analog scheme over the induced 
equivalent channel. These scenarios demonstrate the advantage 
of choosing a modulation basis based upon multiple links in the 
network, thus we coin the approach "network modulation". 

Index Terms — Broadcast channel, MIMO, multicasting, gener- 
alized triangular decomposition, geometric mean decomposition, 
GSVD, GDFE, multiplicative majorization, joint source-channel 
coding. 

I. Introduction 

The choice of modulation domain plays a major role in 
communication, both in deriving performance limits and in 
the design of practical schemes which decouple the signal 
processing task of channel equalization from coding. Thus, 
choosing the "right" basis is of central importance. For ex- 
ample, the capacity of the Gaussian inter-symbol interference 
(ISI) channel is given by the water-filling solution, applied in 
the frequency domain; the same transformation also allows to 
use popular schemes such as Orthogonal Frequency-Division 
Multiplexing (OFDM) which employs the discrete Fourier 
transform. The singular value decomposition (SVD) plays a 
similar role for multiple-input multiple-output (MIMO) chan- 
nels. Common to both cases is diagonalization: They yield 
parallel independent equivalent channels. But do we really 
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need such orthogonality? Capacity can be achieved for both 
the ISI and MIMO channels using non-orthogonal equivalent 
channels, by a receiver which performs triangularization of 
the channeU (rather than diagonalization) and then decision- 
feedback equahzation or successive interference cancellation 
(SIC), see e.g. HI. This is done without performing any 
transformation at the transmitter It is therefore natural to 
ask, what can be achieved by allowing both a transmitter 
transformation (in addition to the receiver one) and SIC. 

One such direction, pursued by Jiang, Hager and Li [2|, is 
the generalized triangular decomposition (GTD): A matrix A 
is decomposed as 

A = UTV'^ , 

where U and V are unitary matrices, denotes the conjugate 
transpose of V and T is upper-triangular with a prescribed 
diagonal. It is shown in Isl, ||4| that the transforming matrices 
U and V exist if and only if the (desired) diagonal elements 
of T obey Weyl's multiplicative majorization relation with the 
singular values of A (see also ||5l). Since the product of these 
diagonal elements equals the product of the singular values of 
A, the decomposition performs diagonal shaping: It distributes 
the total gain between the diagonal elements in a desired way. 
An important special case is where balanced gains are sought, 
i.e., the diagonal elements of T should all be equal. In that 
case, named the geometric mean decomposition (GMD) ||6l, 
the majorization condition holds for any A. When applied to 
MIMO communication, GMD has an advantage over SVD, 
that all subchannels enjoy the same gain, and thus may support 
codebooks of the same rate, avoiding the need for a bit-loading 
mechanism. This comes at the price of performing SIC at the 
receiver. The GMD has received considerable attention; see, 
e-g-, Q-im for some of its applications. 

We take a different path, in which we wish to jointly 
shape the diagonals of two matrices, for the purpose of multi- 
terminal communication. Since with this approach the choice 
of basis depends upon more than one communication link, 
we call it network modulation. We jointly triangularize two 
matrices Ai and A2 as 

A,^U,T,V\ « = 1,2, (1) 

where Ui, U2 and V are unitary and Ti and T2 are upper- 
triangular. Having the same matrix V on one of the sides of 

'Outside the high signal-to-noise ratio regime, "near triangularization" is 
performed as an optimal balance between residual interference and noise. 



the decomposition corresponds to applying the same trans- 
formation, and is thus suitable to two links originating (or 
terminating) at the same node. It turns out that, in different 
network applications, it is important to shape the vector of 
ratios between the diagonals. We show that the sufficient 
and necessary condition for achievability of a ratio vector 
is a multiplicative majorization relation with the generalized 
singular values JTOl of the pair (^1,^2). 

We start by deriving the necessary and sufficient condi- 
tions for joint unitary triangularization of two matrices, in 
Section III-BI In the rest of the paper we apply this result in 
two different scenarios, where in one we present an optimal 
practical scheme for a problem for which the capacity is 
known, and in the second we derive the (hitherto unknown) 
optimal performance. In Section |lll] we combine the joint 
triangularization with the concept of SIC, to present an optimal 
scheme for two-user digital multicasting that employs linear 
processing of scalar codebooks. In Section |IV] we address the 
problem of transmission of an analog source over two MIMO 
links, where we show that a ratios vector of all-ones except 
for one element creates an equivalent channel over which a 
hybrid digital-analog (HDA) scheme can achieve the optimal 
tradeoff between user distortions; thus we derive the optimal 
performance whenever the channels are such that this ratios 
vector is feasible. We conclude the paper in Section |Vl 

We note that the decomposition may equally be applied 
to cases where two transmitters communicate with a joint 
receiver via MIMO links (a MIMO MAC channel). In this case 
the roles of the U and V matrices in ([T]i are interchanged. An 
application of the decomposition in such a setting is a MIMO 
extension of the "physical network coding" approach to bi- 
directional relays fTTj . This application is beyond the scope 
of this paper, and appears in lfT2ll . 

II. Joint Unitary Triangularization 

In this section we present the joint unitary triangular de- 
composition of two matrices. We start by introducing a few 
notations and recalling known decompositions for a single 
matrix. 

A. Unitary Triangularization of a Single Matrix 

Throughout the work, we will only need to decompose 
matrices which belong to the following class. 

Definition 1 (Proper dimensions): An m x n matrix A is 
said to have proper dimensions if it is full-rank and m > n. 

The singular-value decomposition (SVD, see ifTsl ) of a 
matrix A of proper dimensions m x n, is given by: 

A = [/Tyt , (2) 

where U and V are unitary matrices, and T is an m x n 
(generalized) diagonal matrix, viz. Tij = for i ^ j. 

Remark 1: Throughout the paper, we will assume in all 
decompositions that all the diagonal elements ^ are real 
and non-negative. This is without loss of generality, since any 
phase can be absorbed in U and V. 



The diagonal elements of T are called the the singular 
values (SV) of A; they equal the square-roots of the eigen- 
values of A^A. Since we assumed A to be full-rank, all 
its SVs are strictly positive. We define the SV vector ti{A) 
as the 77 -dimensional vector composed of all SVs (including 
their algebraic multiplicity), ordered non-increasingly. /i(A) is 
unique, i.e., there is no other diagonalization, up to ordering 
and phases of the diagonal. 

Unitary triangularization coined generalized triangular de- 
composition (GTD, see |2|) is a generaUzation of the SVD to 
triangular matrices T. The class of matrices is formally defined 
as follows. 

Definition 2 (Square part): Let A be a matrix of dimen- 
sions m X n where m > n. The square part of A, denoted 
[A], consists of the first n rows of A. 

Definition 3 (Generalized triangular matrix): Let T be a 
matrix of proper dimensions. We call T a generalized triangu- 
lar matrix, if Tij = for i > j, i.e., it has the block structure 

where the square part [T] is upper-triangular. 

As for the SVD, we assume non-negative diagonal elements 
(see Remark [T]), and they are all positive under the full-rank 
assumption. Note that for any unitary triangularization, 

n 

det (A^A) = det (T^T) = (det[r])' = n^^J-j)' ■ 

j=i 

It turns out, that the singular values are an extremal case 
for the diagonal of all possible unitary triangularizations. For 
stating this, we need the following. 

Definition 4 (Multiplicative majorization (see M^JJ-' Let x 
and y be two ?7-dimensional vectors of positive elements. 
Denote by x and y the vectors composed of the entries of 
X and y, respectively, ordered non-increasingly. We say that 
X majorizes y {x>y) if they have equal products: 

n n 

n = n yj ' 

and their (ordered) elements satisfy for any 1 < < 77, 

fe fe 

n ^ n ■ 

In these terms, we can give the condition of the GTD 
121 for the existence of a unitary triangularization: Let A 
be a matrix of proper dimensions m x n and t be an n- 
dimensional vector of positive elements. Then, there exists a 
unitary triangularization of A with diagonal t, i.e., A can be 
decomposed as in ^ with T being some generalized upper- 
triangular matrix with the prescribed diagonal t, if and only 



if the latter is majorized by the singular-values vector of A: Algorithm 1 : Joint Triangularization of Square Matrices 



B. Joint Triangularization with Shaped Diagonal Ratio 

The SVD presented in Section III-AI is essentialy unique. 
Thus, in general, two matrices cannot be jointly diagonalized 
by unitary matrices. Nevertheless, joint triagonalization ([T]i is 
possible. In this section we prove the necessary and sufficient 
conditions for such triangularization, formally defined as fol- 
lows. 

Definition 5 (Joint Unitary Triangularization): Let Ai and 
A2 be matrices of proper dimensions with the same number 
of columns. A joint decomposition: 



Ai = C/iTil/^ 

A2 = U2T2V^ , 



(4) 



is called a joint unitary triangularization if Ui, U2 and V 
are unitary, and Ti and T2 are generalized upper-triangular 
matrices of the same dimensions as Ai and A2, respectively. 

The existence condition for this joint decomposition turns 
out to be similar to that of the GTD (see Section Ill-Al l, where 
the SVs are replaced by generalized singular values, and the 
diagonal of T is replaced by the ratio of the diagonals of 
{Ti,T2). These quantities are defined below. 



Definition 6 (Generalized singular values hl(M ): For any 
(ordered) matrix pair {Ai,A2), the generalized singular 
values (GSVs) are the positive solutions a of the equation 



Let the GSV vector ^i{Ai,A2) be the vector composed of 
all GSVs (including their algebraic multiplicity), ordered non- 
increasingly. 

Remark 2: For matrices of proper dimensions we have n 
GSVs, all positive (recall Remark [Til and finite. For non full- 
rank matrices we still have n GSVs, even if the number of 
finite solutions is smaller. We define a GSV to be infinite, if 
the corresponding GSV of the matrices in reversed order is 
zero. If the number of finite and infinite solutions is smaller 
than n, this suggests that the column rank can be reduced 
without changing the problem; we shall assume the problem 
is in its reduced form. 

Remark 3: When Ai and A2 are square and non-singular, 
fi{Ai,A2) consists of the singular values ofAiA^^ ifTOl . lfT3l . 

Definition 7 (Diagonal ratios vector): Let Ti and T2 be 
two generalized upper-triangular matrices of proper dimen- 
sions nil X n and 7712 x n, respectively, with positive diagonal 
elements. The diagonal ratios vector r{Ti,T2) = '"([Ti], [T2]) 
is the n-length vector whose j-th entry is equal to the diagonal 
ratio Ti-jj/T2-j.j, where Ti-j^k denotes the {j,k) entry of Ti 

(7=1,2)'. 



. Compute B = AiA2^. 

• Apply the GTD to B (for details regarding the im- 
plementation, including Matlab code, see ||2l): 

B = UiRUl , 

with R having a diagonal equal to (the desired diag- 
onal ratio) r{Ti,T2). 

• Decompose UjAi according to the RQ decomposi- 
tion: 

ulAi = Ti^t 
UIA2 = T2V^ . 



Algorithm 2 : Joint Triangularization 



Apply (individual) QR decompositions to Ai and A2: 

A,^Q,R, i = l,2. 

Extract [Ri], the upper square n x n part of Ri. 

Apply Algorithm [T] to [_Ri] and [R2] to obtain a joint 
decomposition 

[Ri] = UifiV^ 

[R2] = U2f2V^ , 

where the diagonal ratio vector of {Ti,T2) is equal to 
the desired diagonal ratio, i.e., r(T'i, T2) = r{Ti,T2). 



> Construct the matrices: 



T = 







where the lower zero block is of dimensions 

(rrii — n) X n, and 

■ U^ 

■^rrii—n 



We are now ready to prove the main result of this section, 
giving the condition for existence of joint unitary triangular- 
ization in terms of majorization (recall Definition |4|i. Since the 
existence proof is constructive, it results with a decomposition 
procedure; this is summarized in Algorithm [T] for the simpler 
case of square matrices, and in Algorithm |2] - for the general 
case. 

Theorem 1: Let Ai and A2 be matrices of proper dimen- 
sions 7771 X n and 7772 x 77, respectively, and r be an 77- 
dimensional vector with positive elements. Then, there exists a 
joint unitary triangularization of (Ai, A2) with diagonal ratio 



vector r, i.e., the matrices can be decomposed as: 

Ai = UiTiV"^ 

A2 = U2T2V^ , 



(5) 



where Ui, U2 and V are unitary, and Ti and T2 are some 
generalized upper-triangular matrices with the prescribed di- 
agonal ratio (the i-th element of r{Ti,T2) equals r^), if and 
only if the latter is majorized by the GSV vector of {Ai, A2): 



/^(Ai,A2) >r r. 



(6) 



Proof: Achievability part. We present here a proof for 
the case when the matrices are square (mi = m2 = n). The 
extension to the general proper-dimension case is relegated to 
Appendix |A] 

In the square case, Ai and A2 must be invertible, being full- 
rank. Define the matrix B = AiA^^. The SV vector of B, 
H{B), coincides with the GSV vector of (^1,^2), /it(Ai, A2) 
(recall Remark[3]i. Thus, it majorizes r, by assumption. Hence, 
according to the GTD (see Section ITl-Al i. the matrix B can be 
decomposed as 



B = UiRUl 



(7) 



where Ui and U2 aie unitary and R is upper- triangular with 
a diagonal vector which equals r. Now, apply RQ decompo- 
sitions to U^Ai (i = 1, 2) to achieve 



U A, 



(8) 



where Ti are upper-triangular with positive diagonal entries 
and Vi are unitary. By substituting (O into (|7]) we have 

UiTiV^V2T2^ul = UiRUl , 



which is equivalent to 



V^V2 = Tf iflTa . 



(9) 



We note that the l.h.s. of (|9]l is unitary, whereas its rh.s. is 
an upper-triangular matrix with positive diagonal entries. An 
equality between such matrices can hold only if both matrices 
are equal to the identity matrix of the appropriate dimensions 
{n X n). Thus, we have 



V'^Vi=V2, 



1, 



Since the diagonal of R is equal to r, this establishes the 
desired decomposition 

Converse part. Assume, in contradiction, that Ai and A2 
can be decomposed as in Q such that fi{Ai, A2) ^ r(Ti, T2). 
Note that fJ,{Ti,T2) = n{Ai,A2). Moreover, [Ti] and [T2] 
are non-singular square matrices of dimensions n x n with 
GSV and diagonal-ratios vectors which are equal to those of 
(Ti,T2), i.e., 

/x([Ti],[r2])=/x(Ti,r2) = MAi,A2), 
r([Ti],[r2])=r(Ti,r2). 



Thus ni[Ti], [T2]) ^ T-([ri], [T2]), which in turn implies that 
the upper-triangular matrix B= [Ti][T2]^^ has a diagonal 
r([Ti],[r2]) and an SV vector /.i([ri], [T2]). But according 
to Weyl's condition fS): 

A2) = [T2]) h r([Ti], [T2]) ^ r(ri,T2) , 

in contradiction to the assumption. ■ 

Remark 4: Note that we did not require the matrices Ti 
and T2 to satisfy Weyl's condition individually, as we did not 
strive to design specific diagonal values but rather prescribed 
ratios between the diagonals of Ti and T2. Indeed, one may 
verify that the resulting matrices in Theorem [T] satisfy Weyl's 
condition individually. 

Remark 5: By the unitarity of Ui, U2 and V, the products 
of fjL and r are equal. Thus, the majorization relations mean 
that the diagonal ratios are always "less spread" than the gen- 
eralized singular values. This is also true for the (individual) 
diagonal values of Ti (i = 1,2) being "less spread" than the 
singular values of Ai. 

Remark 6 (Relation to GSVD): The GSVD HOl can be 
stated in a triangular form Q, with diagonals ratio 
r(Ti,T2) = n{Ai,A2). Thus, the GSVD is a limiting case 
with maximal ratios spread. 

Remark 7 (Relation to GTD): Taking in the joint decom- 
position A2= I yields the GTD of Ai further, the GSV 
vector becomes the S V vector of Ai . The existence condition, 
in turn, reduces to the Weyl condition (see e.g. In this 
sense, the condition in Theorem[T]may be seen as a generaUzed 
Weyl condition for joint triangularization. 

Remark 8 (Relation to the generalized Schur decomposition): 
This decomposition, also called the QZ-decomposition |fT3l . is 
a special case of the joint triangularization (|5]l with Ui = 1/2- 
It can be shown that the diagonal ratio vector induced by 
this decomposition is unique, i.e., requiring that the unitary 
matrices are the same on both sides prohibits shaping of the 
diagonal ratios. 

Remark 9 (Fixed diagonal ratio): Note that any vector fi 
majorizes the vector where all elements equal to its geometric 
mean. As a consequence, for any two matrices there exists a 
joint decomposition with fixed diagonal ratio. We use this fact 
in Section HiTbI 

The joint unitary triangularization (and, as a special case, the 
GTD) can also be relaxed to a block form: 



T, 



( T^■.l,l 








T'i;l,2 
Ti:2,2 



Ti-l.K 
Ti;2,K 

Ti-K.K 




(10) 



where Ti^k^i is a block of dimensions Uk x n/, such that 
'}2/k=i'^k = n (thus the last row of blocks consists of n — m 



all-zero rows). Note that we require corresponding blocks in 
both matrices to be of the same dimensions, except for the 
last row of zero blocks. 

The existence condition can be stated as the following exten- 
sion of Theorem [1] Denote by {ki}fLi the indices satisfying: 

r-fci > rfc2 > • • • > , 



where r/, = "-^jdct {Ti-kM) I det (Ti.k,k)\- Denote by p 
the X-length vector composed of (i.e., the 

absolute values of the ratios between determinants of corre- 
sponding blocks), ordered in non-increasing order of {r^}. 
Further denote by fi the ii'-length vector composed of the 
products of sizes {fc;}[£j of the GSVs of {Ax^A-i) ordered 
non-increasingly, i.e., the first entry of is the product of 
the largest fci GSVs, its second entry is the product of the 
next fc2 GSVs, etc. Then, the desired block triangularization 
is possible if and only if the products of the entries of p and 
of the GSV vector are equal, and for any 1 < A: < 

k k 

1=1 1=1 

III. Transmission Scheme for Multicasting 

In this section we derive an optimal practical communica- 
tion scheme for two-user multicasting over Gaussian MIMO 
BC channels. We start by recalling how the single-user 
Gaussian MIMO capacity may be achieved using multiple 
codebooks (each designed for a scalar AWGN channel) with 
SIC over an equivalent channel, obtained by unitary triangu- 
larization (as in Section Hi- Al l. 

A. SIC for MIMO Channels 

The exposition below follows that of the universal matrix 
decomposition (UCD) [14], which is in turn based upon the 
derivation of the MMSE version of V-BLAST, see, e.g., ifTSl . 
Later in the paper we take the triangularization to be one 
which is simultaneously good for two users. This is suppressed 
for now. We assume throughout the paper perfect channel 
knowledge everywhere. 

We consider a point-to-point (complex) MIMO channel: 



y 



Hx 



(11) 



where x is the channel input of dimensions A't x 1 subject to 
an average power constraint y is the channel output vector 
of dimensions Nr x 1; H is the channel matrix of dimensions 
Nr X Nt and z is an additive circularly-symmetric Gaussian 
noise vector of dimensions Nr xl. Without loss of generality, 
we assume that the noise elements are mutually-independent, 
identically-distributed with unit variance. 
The capacity of this channel is given by 



C{H,P) 



max/(i7, Ca;), 
Cx 



(12) 



^These conditons ai'e similar to the majorization conditions of Definition |4] 
up to the ordering which is done w.r.t. rj. and not those of the entries of p. 

^Alternatively, one can consider an input covariance constraint Cx — 
E [tccctj ^ C, where by Ci :< C2 we mean that the matrix (C2 — Ci) 
is positive semi-definite. 
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Fig. 1. Multiple-codebook transmitter with linear precoding 



where the maximization is over all channel input covari- 
ance matrices Cx > 0, subject to the power constraint 
trace (Cjc) < F, andE 



KB, Gx) = log dot (/ + HCxU^) 



(13) 



We may interpret I{H,Cx) as the maximal mutual informa- 
tion that can be attained using an input covariance matrix Cx, 
which is achievable by a Gaussian input x. 

In order to achieve a rate approaching this mutual informa- 
tion, optimal codes of long block length are needed. However, 
as pointed out in the introduction, we take an approach 
which decouples the signal-processing aspects from these of 
coding. We thus omit the time index throughout the paper; for 
example, when referring to an input vector x, we mean the 
input at any time instant within the coding block. In a practical 
setting using encoder/decoder pairs of some given quality, one 
may easily bound the error probability of the scheme using the 
parameters of the codes. 

When coding over a domain different than the input domain 
(e.g., time or space), one may start with a virtual input vector 
X, related to the physical input by the linear transformation: 



X = C^^Vx. 



(14) 



We form the vector x, in turn, by taking one symbol from each 
of Nt parallel codebooks, of equal power l/Nf. The matrix 
V is a unitary linear precoder See Figure [T] 

Recalling the GTD (see Section Ill-Al l, one may suggest to 
choose V by applying a unitary triangularization to 



F = HC. 



1/2 



(15) 



After the receiver applies the transformation U\ it is left 
with an equivalent triangular channel T, over which it may 
decode the codebooks using SIC. Unfortunately, while this 
"conserves" the determinant of HCxH\ it fails to do so 
when the identity matrix is added as in the mutual information 
I{H,Cx) O- Thus, this is optimal in the high SNR Umit 
only, and an MMSE variation is needed in general, as next 
described. 

We start by applying a unitary triangularization (as in 
Section III-Ab to an augmented matrix: 

F 
In, 



= G = UTV\ 



(16) 



''All logarithms in this paper are to the base 2. 

^The square root of a Hermitian positive-definite matrix A, denoted by 
A^/'^, is defined as the matrix satisfying: A = A^/'^ (A'^/^)^. 




Fig. 2. SIC-based receiver. 



where is the identity matrix of dimensions Nt. Note 
that, by construction, G is of proper dimensions, regardless 
of the dimensions and rank of the channel matrix H. That 
is, it has dimensions {Nt + Nr) x Nt and is full- rank. The 
square matrices U and V have dimensions Nt + Nr and Nt, 
respectively. This allows to decompose the total rate into terms 
associated with the diagonal values of the matrix T, as follows: 

I{H, Cx) = logdet {In,^ + FF^) (17) 
= logdet(/Ar, (18) 
= logdet (G^G) (19) 

Nt Nt 

where ( fTTI i follows by the definitions (fljT l and (fTSl l. ( fTSl l is 
justified by Sylvester's determinant Theorem (see e.g. lfT6l ). 
(fT9T l is a direct application of the definition (fTSI ), and in ( l20l i 
we define i?j = log(rj.j)^. Using the matrices obtained by 
this decomposition, the following scheme communicates scalar 
codebooks of rates {Rj}- 

The transmitted signal is formed using (fT4l l. At the receiver, 
we use a matrix W, consisting of the upper-left Nr x Nt block 
of [/: y = W^y. This results in an equivalent channel: 

y = W''{FVx + z) = W^FVx + W''z ^fx + z. (21) 

Note that since W is not unitary, the statistics of z = W'^z 
differ from those of z. We denote the covariance matrix of the 
equivalent noise by = WW^. Finally, SIC is performed, 
i.e., the codebooks are decoded from last {j = Nt) to first 
ij ~ 1), where each codebook is recovered from: 



Nt 
l=J + l 



(22) 



where xi is the decoded symbol from the l-th codebook; see 
Figure |2] Assuming correct decoding of "past" symbols, i.e. 
xi ~ xi for all I > j, the scalar channel for decoding of the 
j-th codebook is given by: 



(23) 



1=1 



Since T is not triangular, the second term in this scalar channel 
(resulting from elements below the diagonal of T) acts as in- 
terference. The signal-to-intereference-and-noise ratio (SINR) 
is given by: 



Sj = Var 



V x^' 



(24) 



where C^-ij denotes the entry of C^- 

The following, which is equivalent to Lemma III. 3 in lfT4ll . 
shows optimality of the scheme. 

Proposition 1: For any channel H and input covariance 
matrix Cx, the SINRs Sj ( l24l i of the transmission scheme 
above satisfy: 



log{l + S,) 



(25) 



where the rates Rj are given by (l20l i. 



This completes the recipe for a digital transmission scheme 
which achieves I{H, Cx)'- For a given input covariance matrix 
Cx, choose the individual codebook rates to approach {Rj}, 
the sum of which equals the mutual information afforded by 
the MIMO channel (T3[ . By Proposition [T] the successive de- 
coding procedure will succeed with arbitrarily low probability 
of error for these rates (asymptotically for high-dimensional 
optimal scalar AWGN codes). Taking Cx be the covariance 
matrix maximizing ( fT2] i. capacity can be achieved. 

The above exposition proves the optimality of the "scalar 
coding" approach - the combination of scalar AWGN code- 
books, linear processing, and SIC. This approach offers re- 
duced complexity and easy-to-analyze performance when the 
channel is known at both ends ("closed loop"). Indeed, special 
cases of this approach have been suggested and used. In partic- 
ular, using the SVD results in a diagonal equivalent channel 
matrix T, establishing parallel virtual AWGN channels (no 
SIC is needed), see ifTSl . Other schemes, such as generalized 
decision feedback equalization (GDFE) and Vertical Bell- 
Laboratories Space-Time coding (V-BLAST), see |[T], ifTTl . 
are based on the QR decomposition. These do not require 
linear precoding, i.e., V ^ I. The UCD lfT4l uses both a 
linear precoder and SIC, in order to achieve T with diagonal 
elements that are all equal. 

Remark 10 (Number of codebooks): If desired, one may 
work with any number of codebooks above Nt, as stated in 
To see that, add "virtual transmit antennas" with corre- 
sponding zero channel gains. The capacity remains unchanged, 
and the optimal channel input covariance matrix will not 
allocate power to these "antennas". The number of codebooks 




Rx 1 Rx 2 

Fig. 3. Two-user MIMO BC channel. Even though both users may have similar 
channel quality, the actual channel matrices Hi and H2 differ. 

is equal to the number of antennas, including the additional 
virtual ones. 

All of these schemes have significant advantages over direct 
capacity-achieving implementation for MIMO channels. Such 
high-complexity schemes, e.g., using bit-interleaved coded 
modulation (BICM) in conjunction with sphere detection, 
essentially require the same resources as if working in an 
"open loop" mode. Thus, the complexity involved is similar to 
that required for approaching the isotropic mutual information 
of the channel, when only the receiver knows the channel. We 
note that this advantage comes at the price of suffering from 
error propagation between the codebooks. This effect has been 
analyzed and simulated in many works, see e.g., ifTSl . |fT9l . 

We conclude this section by pointing out a simple extension 
to a unitary transformation which induces a block-traingular 
matrix rather than a strictly triangular one. That is, if the matrix 
R in (fTSI l is of the block generalized upper-triangular form 
( [Tol l, where the block Ti-^k,i is of dimensions Nk x Ni, such 
that Y.k=i = Nf In that case, we employ K < Nt codes 
in parallel, each over an equivalent Nk x Nk MIMO channel, 
achieved by "block-SIC": 

i 

y'^=J2Tj,iXi+z,, j K, (26) 

1=1 

where Tjj is of dimensions Nj x Ni. Seen as Gaussian 
MIMO channels (i.e., seeing residual interference as noise) 
we achieve, as an extension to Proposition [T] a rate 

i?, =logdct(T,-,(r,-,)t) (27) 

over each such block channel. 

B. Optimal Two-User Scheme 

We now derive an optimal practical communication scheme 
for two-user multicasting. 

The two-user Gaussian MIMO broadcast (BC) channel has 
one transmit and two receive nodes, where each received signal 
is related to the transmitted signal through (see also Figure |3]i: 

y.^ = H,x + z,, (28) 

where x is the channel input of dimensions Nf x 1 subject 
to an average power constraint pj^ is the channel output 

*Again, alematively, one can consider an input covariance constraint Cx = 
E [cccct] ^ C. 



vector of decoder i (i = 1,2) of dimensions Nr x 1; Hi is 
the channel matrix to user i of dimensions N^^"^ x Nt and Zi 
is an additive circularly-symmetric Gaussian noise vector of 
dimensions N^'' x 1, where again, without loss of generality, 
we assume that the noise elements are mutually-independent, 
identically-distributed with unit variance. 

This channel has received much attention over the past 
decade. Unlike the single-input single-output (SISO) case, the 
Gaussian MIMO BC channel is not degraded. Nevertheless, 
capacity regions were established for some scenarios, such 
as private-messages only, and for a common message with a 
single private message, and bounds were derived for others, 
see II20I - II24I and references therein. 

We focus our attention on the multicast (common-message) 
problem, the capacity of which is long known to equal the 
(worst-case) capacity of the compound channel ll25l - ll27l . with 
the compound parameter being the channel matrix index: 

CiHi ,H2,P)= max min I{H, ,Cx), (29) 
^x ^— -li^ 

where maximization is over all channel input covariance matri- 
ces Cx > 0, subject to the power constraint trace (Cx) < P 
and the MIMO mutual information I{H, Cx) was defined in 

We wish to use a scalar-coding approach, as applied to 
the point-to-point setting in Section ITlI-AI Indeed, the private- 
message MIMO BC capacity can be achieved by scalar-coding 
(in this case dirty-paper coding) techniques; see, e.g., 1281 . In 
the presence of a common message, however, to our knowl- 
edge, no scalar capacity-approaching coding solutions are 
known. QR-based schemes fail, since requiring the individual 
streams to be simultaneously decodable at all the receivers 
implies that the rate per stream is governed by the smallest 
of the corresponding diagonal elements (in the resulting two 
matrices), (potentially) inflicting an unbounded rate penalty. 
Adapting SVD to this scenario has an additional problem: The 
decomposition requires multiplying by a channel-dependent 
(unitary) matrix at the transmitter, which prevents from using 
this decomposition for more than one channel simultaneouslyO 
As a result of these difficulties, other techniques were pro- 
posed, which are suboptimal in general, see, e.g., 1291 , l30l . 

In this section, we present an optimal successive-decoding 
(low-complexity) scheme for a two-user common-message 
Gaussian MIMO BC channel. Specifically, the proposed 
scheme is based upon SIC and good scalar AWGN codes, 
in conjunction with the following special case of the decom- 
position in Theorem [T] 

Corollary 1: Let Ai and A2 be two matrices of proper 
dimensions mi x n and 7712 x n, respectively, satisfying 

dot [AiA^^ > dot (A2AI^ . (30) 

'indeed, the GSVD allows to use a single transformation for two different 
channels at one of the ends, but for each virtual parallel channel it yields a 
different gain for each user, thus not solving the inefficiency mentioned above. 
In fact, using GSVD may result in worse performance than using a QR-based 
receiver without any transformation at the transmitter since the spread of the 
diagonal ratio is maximal, see Remark |6] 



Then there exists a joint triangularization ^ where 

Tiyj>r2;jj, Vj = l,...,n. 



Proof: An equivalent condition to ( l30b is that the product 
of the entries of /j, = A2) is at least one. Let /2 > 1 be 

the geometrical mean of fi, and let the vector r be the same 
size as fi, with all the elements equal to ji. By construction, 
fJ- h f, thus by Theorem [T] there exists a joint triangularization 
with this ratio. Consequently, there exists a decomposition 
such that for all elements Ti.j j = P'T2-j.j > T2-jj- ■ 

Remark 11 (Admissible diagonal ratios): The proof sug- 
gests that the diagonal ratios vector be made uniform. 
This is always possible, but is not the only choice (unless 
I{Hi,Cx) = I{H2,Cx))- 

The results above defines a communication scheme in the 
following way. For the channels Hi and H2, let Cx be a 
capacity-achieving input covariance matrix, and assume with- 
out loss of generality that I{Hi, Cx) > I{H2, Cx)- Define the 
augmented matrices Ci and C2 as in (fT6] l. By Corollary [T] 
there exists a joint triangularization (|5]l such that each diagonal 
element of [Ti] is at least equal to the corresponding element 
of [T2]. On account of ([TTll-dlOll we have that: 

Nt Nt 

I{H2,Cx)^J2^og{T2,,,)^^Y.R^- 

This rate can be approached using SIC at each receiver as 
in the point-to-point case of Section IIII-AI Specifically, x is 
formed from Nt codebooks of rates {Rj} and power 1/Nt 
each. The transmitted vector is given by the linear precoding 
(O and receiver i performs the linear transformation i2l[ and 
SIC (|22li (substituting Ui and Ti for U and T, respectively). 
Now Proposition [1] guarantees correct decoding of all code- 
books for receiver 2. Since in receiver 1 each SINR can only 
be greater, it will be able to decode as well. 

Remark 12 (Private messages): If, in addition to the com- 
mon message intended to both users, there are private mes- 
sages (messages intended for individual users), superposition 
may be used. That is, part of the transmit power is dedicated 
to the private messages. Then, for the purpose of the com- 
mon message, the transmission for the private messages is 
considered as noise. This approach was shown in ll24l to be 
capacity-achieving in the presence of a single private message, 
and under some conditions on the rate and power - also in the 
presence of two private messages (even when these conditions 
do not hold, superposition gives the best known performance). 
The scheme presented in this section may be used for the 
common-message layer of these superposition schemes as 
well. Interestingly, in that case we would have interference 
cancellation both at the encoder (dirty-paper coding of the 
private messages) and at the decoders (SIC of the common 
message). 



IV. HDA Transmission for Source Multicasting 

In this section we turn from the purely digital setting to a 
joint source-channel coding (JSCC) problem, where we wish 
to multicast an analog source to two destinations, where each 
destination should enjoy reconstruction quality according to 
the capacity offered by its channel. 

The transmission of a source over a BC channel is one 
of the main applications of JSCC. In this setting, JSCC may 
be greatly superior to transmission based upon source-channel 
separation. In a classical example, a white Gaussian source 
needs to be transmitted over a two-user AWGN BC channel, 
with one channel use per source sample, under mean-squared 
error (MSB) distortion. Analog transmission achieves the opti- 
mal performance for each user as if the other user did not exist 
[Un . In contrast, the separation-based scheme (concatenation 
of successive-refinment quantization and broadcast channel 
coding) yields a tradeoff, where if we wish to be optimal 
for the user with worse signal-to-noise ratio (SNR), then both 
users have the same distortion, while optimality for the user 
with better SNR means that the distortion for the other user 
is trivial (equals the source variance). See, e.g., Il32l App. A]. 

We focus on the transmission of an i.i.d. circularly- 
symmetric Gaussian source S to two destinations over a 
MIMO BC channel ( |28] |. with one channel use per source 
sample. We measure the quality of the reproductions Si using 
the MSB distortion measure. Thus, we wish to maximize the 
tradeoff between the signal-to-distortion ratios (SDRs), defined 
as 

SDR, 4 ^^^^^^ , , ^ = l,2. (31) 
Var f Si - S j 

The achievable SDR region S{Hi,H2) is defined as the 
closure of all pairs which can be achieved by some encoding- 
decoding scheme. 

This general problem of describing S{Hi,H2) has not 
received much attention. Nevertheless, in the special cases 
of diagonal or Toeplitz channel matrices, it reduces to the 
better known problem of transmission over a colored and/or 
bandwidth-mismatched Gaussian BC channel, for which dif- 
ferent schemes which outperform the separation approach have 
been presented, see e.g. Il32l - ll36l . However, even for these 
cases optimality claims are not abundant. In f35\, Kochman 
and Zamir show asymptotic optimality for high SNR, where 
the channels have the same bandwidth as the source, and one 
user enjoys a better channel than the other at all frequencies. 
Taherzadeh and Khandani ||36| show that optimality in the 
slope sense (weaker than high-SNR asymptotic optimality) is 
possible for white channels where their bandwidths (BW) are 
integer multiples of the source BW. A similar slope argument 
applies to the general MIMO case as well. 

A simple outer bound on the achievable SDR region is given 
by the following. 

Proposition 2: S{Hi, H2) C S{Hi, H2), where the bound- 



ing region S{Hi,H2) is given by: 

(J {(SDRi,SDR2) : log(SDR,) < I{H„Cx)] , 

where the union is over all matrices Cx > such that 
trace (Ca;) < P, and where the MIMO mutual information 
I{H,Cx) was defined in dlj] ). 

The proof follows that of the classical source-channel converse 
1371 . taking into account that both users share the same 
channel input. 

In Section IIV-AI we find sufficient conditions for achieving 
points on the boundary of this region. Then, in Section IIV-BI 
we present, for the case of two transmit antennas, a simple 
sufficient condition such that all of the region S{Hi,H2) 
can be achieved. Unlike previous work, this proves strict, 
optimality, non-asymptotic in the channel SNR; it applies to 
some cases of color and bandwidth mismatch, although not to 
the white BW-expansion case. 

A. Optimality by HDA Transmission 

We give a constructive achievability proof, which combines 
a hybrid digital-analog (HDA) scheme by Mittal and Phamdo 
ll33l with the joint triangularization approach; the optimum is 
achievable whenever the diagonal ratio vector can be shaped 
according to the needs of the HDA scheme. In order to under- 
stand the function of the HDA scheme, we need to consider the 
following related scenario. In a JSCC multicasting problem as 
above, the BC channel is SISO, i.e., Nt = Nr ^ 1, in which 
case the channel matrices reduce to scalars hi. However, in 
addition, the transmitter node may send some data to the users 
(identical for both) over a digital channel of rate i?digitai bits per 
use of the BC channel. According to the aforementioned HDA 
approach, the source is first quantized according to the rate of 
the digital channel and the quantization error is then sent in an 
analog manner over the analog channel. Thus, the distortion 
is equal to that of the (analog) quantization error, and hence 
optimality (optimum distortion over each channel) is achieved 
simultaneously, as implied by the following proposition. 

Proposition 3: In the setting above, the optimal perfor- 
mance is given by: 

SDRi = (1 + hjP) 2^'««" . 



Proof: We use a vector quantizer which decomposes each 
sample of the Gaussian source S as 



S = S + Q. 



(32) 



The first term is the quantized source, while the second is 
the quantization error. By quadratic-Gaussian rate-distortion 
theory (see e.g. 1381 ). in the limit of high quantizer dimension, 
a quantizer of rate i?digitai may achieve: 



-Rdigital — log 



/ Var (S) 
VVar(Q) 



which is equivalent to 

SDRd,gUai^|^ = 2«-^". 
Var (Q) 

Now the quantizer output representing S is sent over the digital 
channel, thus S can be reconstructed exactly. Given S, the 
reconstruction error of S becomes that of Q. That is. 



SDR, = 



Var (S) 



Var (Q) 



SDRdigital 

Var [Qi - Qj Var [Q^ - Q j 

= SDRanalog.i SDRdigital , 

where Qi is the reconstruction of Q at receiver i using the 
SISO BC channel. Finally by l3TI . analog transmission of Q 
achieves SDRanaiog.j = 1 + hjP, yielding the desired SDRs. 
No scheme can achieve better performance, by considerations 
similar to those leading to Proposition ID ■ 
We use this HDA approach to prove the following. 

Theorem 2: Denote by //, the GSV vector of the augmented 
matrices (fT6] l of the channels with some input covariance 
matrix Cx- If 

Nt Nt-1 

\{^^,<l<\{^^,. (34) 

then any pair (SDRi,SDR2), such that logSDR^ < 
I{Hi,Cx), is achievable. 

Proof: By Theorem [T] the condition (|34] l implies that 
there exists a joint unitary triangularization of the augmented 
channel matrices ( fTSI l with diagonal ratios vector which is all 
one except for one element. The diagonal of Ti can thus be 
made to satisfy 



v.? = 2, 



If we were to send digital data over the MIMO BC channel 
using this particular triangularization, then by (l20b we could 
send over these Nt — 1 channels a rate of: 



Nt Nt 
^digital — ^ Rj = ^ log t 

i=2 j=2 



(33) 



This does not change if we replace, in the transmission 
scheme, ii by a different signal of the same variance P/Nt 
(since in the SIC process, the decoding of each codebook 
only depends on decoding of codebooks with higher index). 
Furthermore, regardless of the signal xi, if the codebooks of 
subchannels 2,. . . ,Nt are correctly decoded then receiver i 
can obtain the equivalent channel (recall (|23]l): 

which has, by Proposition [T] a signal-to-noise ratio of 

SNRanalog.i = {Ti-l,l)'^ — 1 ■ 

At this stage we have turned the MIMO BC channel into the 
combination of a digital channel of rate i?digitai and a SISO 



BC channel of signal-to-noise ratios SNRanaiog.i = 1, 2). On 
account of Proposition [3] one can achieve 

log SDR, = l0g(l + SNR,n,|og) + i?digital 

3 = 1 

= I{H,,Cx), « = 1,2, 

where the last equality is on behalf of (fT7]i-(l20]l. ■ 

Remark 13: In fact, full triagularization is not needed. It 
would have been sufficient to achieve a block-triangular struc- 
ture, where the interference between the last Nt — 1 channels 
is arbitrary (conserving the determinant of the block in Ti). 
However, as indicated at the end of Section ITl-BI this does not 
allow to relax the condition (|34] |: a block corresponding to the 
last Nt — I channels may always be replaced by a triangular 
block with a constant diagonal (see Remark|9]l. Moreover, the 
triangular form is advantageous from the point of view of 
complexity (see Section Hl-Bb . 

Theorem ID does not imply that S{Hi,H2) is fully achiev- 
able, since the conditions on the GSVs should be verified sep- 
arately for each optimal input covariance matrix Cx- However, 
in the sequel we show that for Nt < 2, the condition can be 
verified directly on the channel matrices Hi and H2- Similarly, 
if the channel matrices are of (any) proper dimensions, then 
at the limit of high SNR (as the choice Cx = I becomes 
optimal), the GSVs of the augmented matrices approach those 
of {Hi, H2), thus the condition may be applied to the channel 
matrices directly, verifying achievability of the whole region 
at once. 

B. Two Transmit Antennas 

In this section we consider the case where Nt = 2. In that 
case, the GSV vector fi{Hi,H2) has two elements. We say 
that the GSV vector is mixed, if one of the elements is at least 
one, and the other is at most one. The following is proven in 
Appendix |B] 

Lemma 1: Let Hi and H2 be two matrices of proper 
dimensions, with two columns, and define the augmented 
matrices (as in (fT6]l): 



H,C^'^ 

h 



where C is some Hermitian positive semi-definite matrix. Then 
if fj,{Hi,H2) is mixed, fi{Gi,G2) is mixed as well. 

We use this lemma and Theorem |2] to prove the following. 

Corollary 2: Let Hi,H2 be channel matrices with Nt = 2. 
If fi{Hi,H2) is mixed, then the bounding region S{Hi,H2) 
of Proposition |2] is achievable. 

Proof: For any point on the boundary of S{Hi, H2), let 
/2 be the GSV vector of the augmented matrices with the 
corresponding Cx- By Lemma [T] /i, is mixed as well. Now if 
the product of is at most one, we can apply Theorem |2] If 



it is greater than one, we switch the indices between Hi and 
H2, and then apply Theorem |2] ■ 

Unfortunately, this result cannot be generalized to the case 
Nt > 2: Although at any dimension it remains true that the 
number of GSVs smaller or greater than one is not changed 
by the augmentation, this property does not hold for products 
of GSVs as required for applying Theorem |2] 

In order to demonstrate this result, consider the simplest 
example of the diagonal two-input two-output case|l 



ai 








i = l,2. 



(35) 



The bounding SDR region S{Hi,H2) now becomes; 
U |(SDRi,SDR2) : (36) 



0<7<1 



SDR, < 1 



V)(l + IAr(l-7)^)}. 



In this expression, 7 is the portion of the transmit power sent 
over the first band. 

We point out a few special cases where points on the surface 
of this region are achievable by known strategies. 

1) No BW expansion: Analog transmission. If one of the 
bands has zero capacity, e.g., pi = P2 = 0, ( |36] | reduces 



to: SDR, < 1 



P, which is achievable via analog 



transmission ||3T1 . If for each user a different band is 
usable, e.g., ai = P2 = 0, any transmission (digital or 
analog) which is orthogonal between users is optimal. 

2) Equal SDKs: Digital transmission. A point on the 
boundary which satisfies SDRi = SDR2 may be 
achieved by quantizing the source and then using a digital 
common-message (multicasting) code for the BC channel 
(as described in Section UlI-BI 

3) One equal band: HDA transmission. If for one of the 
bands the gains are equal, e.g., |/3i| = |/32| = we can 
use that band for digital transmission with rate i?digitai = 
log(l + /3^P) and then apply Proposition [3] to achieve the 
bound (|36] |. 

Using network modulation, we can extend the HDA trans- 
mission (case 3 above), by transforming a diagonal channel 
where none of the gains is equal between users, to an equiv- 
alent triangular channel where for one of the bands the gain 
is equal. This can be done under the condition ( |34] |. which 
specializes to (allowing to swap roles between matrices): 



|aip > |a2p and \Pi\^ < |/32 



(37) 



or vice versa. This is an "anti-degradedness" condition: No 
user can have better SNR on both bands. This condition 
subsumes all the cases mentioned above. It is not known 
whether it is a necessary condition, but at least for the case 
where both channels are white (ai = f3i), it was shown in ||39l 
that simultaneous optimality is not possible. 

'^Being diagonal, this channel may be obtained from a single-input single- 
output Gaussian inter-symbol interference channel which has a two-step 
frequency response, by applying the discrete Fourier transform. 




- Network modulation (optimal) 
■ Separation 
"Naive" HDA 



SDR, [dB] 

Fig. 4. Performance comparison for ai = 1, /3i 



10,02 =132 = 2,P = 1. 



Figure |4] shows a numerical evaluation of performance for 
some gain values. It can be appreciated that the optimal 
performance imposes almost no tradeoff between users. Indeed 
the only tradeoff comes from the need to choose the same Cx- 
Thus, in the limit of high SNR, both users attain their optimal 
single-user performance. For comparison, we show the per- 
formance of a separation-based scheme, where a successive- 
refinement source code is transmitted over a digital broadcast 
channel code, as well as that of a "naive" HDA scheme, where 
transmission is digital over one band and analog over the other. 

V. Conclusions 

This work considered the problem of transmitting analog 
(source) and digital information over MIMO communication 
networks. For this, we proposed a new decomposition which 
triangularizes two matrices simultaneously using the same 
unitary transformation on one side, and different - on the 
other, which allowed, in turn, to accommodate a modulation 
to the network and the desired application. We then showed 
how using one version of this decomposition it is possible to 
construct a practical capacity-achieving scheme for two-user 
multicasting (which may also be useful in different relaying 
problems), whereas a different version of this decomposition 
becomes useful when transmitting the same (analog) source 
over two different MIMO channels. In the latter case, this 
technique allowed deriving new achievable regions, which 
proved optimal for a class of channels. 

Appendix A 

Joint Decomposition for Non-Square Matrices 

In this Appendix we complete the proof of the direct part 
of Theorem [T] by considering the general proper-dimension 
case. 

We start by decomposing Ai using the QR decomposition: 



QiRi 



1,2, 



where Qi is unitary and Ri is generalized upper-triangular 
with non-negative diagonal entries. Moreover, the GS V vectors 
of (^1,^2) and (i?i, i?2) are equal, A2) = i?2), 

since Ai and Ri are equal up to a unitary transformation on 
the left. 



Since Ai is full-rank and rrii > n, the diagonal elements 
of Ri are all (strictly) positive and the entries on its lower 
(m, — n) rows are all zeros. Note that the square parts 
and [R2] are non-singular, with ^t([i?i], [R2]) = At(-Ri, R2) = 
/x(^i,^2). Thus /x([i?i],[i?2]) > r(Ti,r2). Invoking the 
proof for the square case in Section III-BI we may decompose 
[i?i] and [i?2] simultaneously as: 

[R2] = U2f2V^ , 

where r{Ti,T2) = r{Ti,T2). Now, construct the augmented 
unitary matrices Pf. 



^niA—r 



and the generalized triangular matrices Ti of dimensions 

m," X n: 







Thus, we arrive at the desired decomposition of Ai and A2 
with [/, = Q,P, and V. ■ 

Appendix B 
Proof of Lemma[T] 

We first Note that if C is singular, then at least one of 
the elements of /x(Gi, G2) equals one, suggesting the latter is 
mixed. Thus, we are left with the case of a non-singular matrix 
C. Let F, = H,C^/^ for i = 1,2. We claim that /x(i^i,i^2) 
must be mixed. This is true, since for a non-singular matrix C: 
/^(i^i, F2) = fj,{Hi,H2). It is left to show that if fi{Fi,F2) is 
mixed, then so is /^(Gi, G2). To that end, define the quadratic 
functions: 



p{x) = dot {^fIFi - xF^F2 
q{x) = dct (g\Gi - xGIG2 



By Definition|6] the roots of p{x) and q{x) equal the square of 
the elements of fi{Fi,F2) and /x(Gi,G2), respectively. Thus 
it suffices to prove that if the roots of p{x) are not on the same 
side of 2; = 1, then so are the roots of q{x). By the positive 
semi-definitiveness of Fi and Gi, both functions are convex-lj 
with p(0), (7(0), p(oo), (7(00) > 0. By the assumption on the 
roots of p{x), it must be that p{l) < 0. But since 



G-^ Gi 



G2G2 



fIfi 



FIF2 . 



we have that q{l) = and thus ^(1) < 0. Finally, a 

convex-lj continuous function which is non-negative at .t = 
and for x 00, and is non-positive at .t = 1 cannot have 
both roots on the same side of 1. ■ 
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